MalFSM: Feature Subset Selection Method for Malware Family Classification
نویسندگان
چکیده
Malware detection has been a hot spot in cyberspace security and academic research. We investigate the correlation between opcode features of malicious samples perform feature extraction, selection fusion by filtering redundant features, thus alleviating dimensional disaster problem achieving efficient identification malware families for proper classification. authors use obfuscation technology to generate large number variants, which imposes heavy analysis burden on researchers consumes lot resources both time space. To this end, we propose MalFSM framework. Through method, reduce 735 contained Kaggle dataset 16, then fuse metadata (count file lines size) total 18 find that machine learning classification is high accuracy. analyzed interpreted selected features. Our comprehensive experiments show highest accuracy can reach up 98.6% only 7.76 s Microsoft.
منابع مشابه
Feature Selection for Malware Classification
In applying machine learning to malware identification, different types of features have proven to be successful. These features have also been tested with different kinds of classification methodologies and have had varying degrees of success. Every time a new machine learning methodology is introduced for classifying malware, there is the potential for increasing the overall quality of malwar...
متن کاملFeature Selection and Extraction for Malware Classification
The explosive amount of malware continues their threats in network and operating systems. Signature-based method is widely used for detecting malware. Unfortunately, it is unable to determine variant malware on-the-fly. On the hand, behavior-based method can effectively characterize the behaviors of malware. However, it is time-consuming to train and predict for each specific family of malware....
متن کاملA Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems
Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...
متن کاملA Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems
Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...
متن کاملImage Classification Using Feature Subset Selection
Classification technology is essential for fast retrieval in large database. This paper proposes a combining GA and SVM model to content-based image retrieval. The proposed method is also used to classification similar images from database. Joint HSV histogram and average entropy computed from gray-level co-occurrence matrices in the localized image region is employed as input vectors. Genetic ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Chinese Journal of Electronics
سال: 2023
ISSN: ['1022-4653', '2075-5597']
DOI: https://doi.org/10.23919/cje.2022.00.038